Improved Prediction of Tone Components for F0 Contour Generation of Mandarin Speech Based on the Tone Nucleus Model
نویسندگان
چکیده
Improved prediction of tone components was realized in our method for synthesizing sentence fundamental frequency (F0) contours of Mandarin speech. The method is based on representing a sentence logarithmic F0 contour as a superposition of tone components on phrase components as in the case of generation process model (F0 model). The tone components are realized by concatenating their fragments at tone nuclei predicted by a corpus-based method, while the phrase components are generated by rules under the F0 model framework. In the original method, tone components are assumed to have similar shapes as F0 contours at tone nuclei. This is based on the assumption that the phrase components are almost flat throughout an utterance. However, this is not the case especially for phrase component initials. To cope with this problem, parameters representing tone components of tone nuclei are modified. Also, predicted parameters in earlier processes are used for the prediction of following processes. Result of the listening test conducted for synthetic speech with the generated F0 contours by our methods and also by the HMM-based method confirmed the advantage of ours, especially the improved version.
منابع مشابه
Two-step generation of Mandarin F0 contours based on tone nucleus and superpositional models
A 2-step scheme was developed in our method for synthesizing sentence fundamental frequency (F0) contours of Mandarin speech. The method is based on representing a sentence logarithmic F0 contour as a superposition of tone components on phrase components as in the case of generation process model (F0 model). The tone components are realized by concatenating tone nucleus F0 patterns generated by...
متن کاملGeneration of F 0 Contours for Mandar in Speech in Combination with Rule-based and Corpus-based Methods
A method was developed for synthesizing sentence fundamental frequency (F0) contours of Mandarin speech. It is based on representing an F0 contour in logarithmic frequency scale as a superposition of tone components on phrase components as in the case of generation process model (F0 model). The tone components are realized by concatenating their fragments at tone nuclei predicted by a corpus-ba...
متن کاملGeneration of fundamental frequency contours for Mandarin speech synthesis based on tone nucleus model
A new method for generating sentence F0 contours of Mandarin speech is proposed. The method assumes the F0 contour generation process model, but generates the tone and phrase components in different ways and sums them to produce a sentence F0 contour. The tone component is generated concatenating F0 patterns of tone nuclei, which are predicted by a corpus-based scheme (binary decision trees). E...
متن کاملGeneration of fundamental frequency contours for Thai speech synthesis using tone nucleus model
As classic and intrinsic requirements, synthetic speech need to convey correct information with good quality of naturalness to listeners. Fundamental frequency (F0) contours need to be controlled to meet these requirements. Additional challenges have been introduced to tonal languages because the F0 contour reflects both intelligibility and naturalness of the speech. According to the fact that ...
متن کاملSubsyllabic Tone Units for Reducing Physiological Effects in Automatic Tone Recognition for Connected Mandarin
This paper presents our attempt to model physiological transition effect on syllable F0 contour in order to improve lexical tone recognition performance for Mandarin Chinese. We suggested that a syllable F0 contour consists of three segments: onset course, tone nucleus and offset course. Among the three segments, only tone nucleus contains key features for tone recognition, and the other two re...
متن کامل